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Meestimating.tve Circular Probable Error, wore commonly referred to as 
the Cer. <A comparison is made between the estimates of tre CEP under 

two cistinct models. ‘The models are identical excent for the location 

of the mean vector in relation to the target. Tune assumpntion of depen- 
dence is made in both inodcels and the resulting estimates are cospared 

with the corresponding estimates obtained under the assuxption of inde- 
pendence, Confidence interval-estimates of the CEP are also presented. 
Gyo methods of removing outlier or “Maverick observations are introe 
G@imged @d some of tue possible effects on tise estimatec CEP are discussed: 
The different estimating procedures are illustrated vit’. tree numerical 
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&&sumec before making judgement on the lesality of any assuyzption, <his 
thesis therefore, attempts to explain such assumptions and to compare 
passibie vesults of making certain assumptions in three cx@mple voroblers. 
pee problems ave all ficticious and utilized only for tre purpose of 


explaining the estimating procecures and assumptions, 
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and I[—f introduce the most comaonly used estimating procedures, Section 
Iv explains the problem of deleting outlying observations from the deter. 
mination of the estimate, Section V introduces the confidence interval, 
Section VI is a sununary of the techniques used in the previous sections, 
Appendix A is concerned with the mathematical techniques which are used 
to explain and transform the true orientation of the dependent variables, 
Appendix B explains two methods of obtaining unbiased estimates of the 
CéP, Appendix C explains in detail the methods of integrations used, 

It is recommended that Appendices A and C be studied before starting 
Section It. 

This thesis was written during the period JanuaryeJune 1962 at the 
United States Naval Post Graduate School, Monterey, California, I wish 
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patience, encouragement, and most competent guidance while acting as 
Faculty advisor, and to Professor Max tlhods for his continuous aid and 
technical understanding of the problem while acting as second reader, 

I also wish to thank my wife for the moral, clerical, and artistic 
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INTRODUCTION TO aE PROGEEF 


1.1 General Discussion of the Circular Probable Error 

The problem of determining useful estimates of the parameters 
waich describe the distribution of the fall of shot about a target is 
directly related to the high cost of testing expensive weapon systems. 
Since relatively few tests are allowed because of this expense, it is 
not improbable that a g0o0d weapon system Could be completely rejected 
because of inefficient utilization of the small amount of data available. 
Also, the size and yield of the warhead is directly related to the esti- 
mated parameters. If the estimated variance is large, the effective 
radius will also have to be large to cover the target complex, and in 
turn the missile will not be able to reach the ranze of the Same missile 
with a smaller warhead. The most efficient use of the limited data will 
thus greatly reduce the risk involved in reducing the warhead size and 
increase the potential range. It also may aid in weapon deployment or 
assignment to larger targets because of the greater confidence that can 
De placed in the estimates. It seems logical that if a great deal of 
confidence can be placed in the weapon, fewer weapons will have to be 
assiened to a target, thus releasing some weapons for other targets, 
The Pree etant point is that the confidence placed on the estimators 
must be high enough to reduce the risk involved and provide a sound 
basis for decision, 

One method, which is commonly used, to measure and compare the 


estimated parameters, is called the circular probable error or C&P 


method, The CEP is defined as the radius of the circle with center 





ae (uu ,) which includes 50% of a bivariate probavility mass. The 
illustration in figure (1) shows the forin of this function, It is to 

be noted chat most of the volume under the curve is centered at the tar- 
Bet and decreases as the distance increases From the target. This pare 
ticular function is well founded historically on the basis of the 


analysis of observations from long range gun fire. 
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Bivariate Probability Mass 


Fisure l 
The bivariate normal distribution is a generalization of the normal 
distribution of a single variate and is bell shaped as shown in figure (1) 
above. Any plane parallel to the x,y plane that cuts the surface will 


intersect the bell in the elliptical curve show in figure (2). 
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Bivariate Density Function which has been 
Cut by a Plane Parallel to the x,y Plane, 


Figure 2 
Any plane perpendicular to the x,y plane will cut the surface in 


a curve of the normal form as shown in figure (3). 
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Bivariate Density Function wiich has bzen (Quit 
by a Plane Perpendicular to the x,y Plane. 


Fieure 3 
The bivariate density function actually represents a five parameter 
family of distributions, the paraneters being the means uate the 
variances v7 ee and the correlation coefficient (aes This -Eunctionsas 
Ssyimetric about the means and has its greatest value at the point egies 
[It should also be noted that if the errors in the x and y directions are 
independent and the variances equal, then the distribution will be in 


the shape of a bell with two of the opposites sides "pushed in" an 
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equal amount, The effect of the variance is shown in figure (4). 
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Two Bivariate Density Functions with 
Different Variances about (u,,u )}: Side View 
ras y 


Figure 4 

If the variances are equal, a plane cutting the surface, as in 
fieure (2), will intersect the bell in a circle. 

The height of the curve, forming the density function, at any 
point "a? is related to the probability of that point. Since this funce- 
tion is continuous, the probability must be expressed in the form of an 
interval since the probability of any single point is zero. ilowever, 
Bm@e probability that a .a.tw.: vosidole X, in the distribution being 
Considered, falls in an interval is equal to the area under the curve 
in the interval being considered, That is, the probability that 
agk¢b is equal to the area shown under the curve in figure (5). 
Note that since the area under the curve about the point ses is the 
greatest, the probability that the random variable X fall in this interval 
is greater tnan that of an interval of equal length away from the point 


(Uys). This is shown in figure (6). 








Univariate Density Function Univariate Density Function 
Showing the Area Under Cone» Showing the Areas Under Cone 
sideration When Determining Sideration In the Intervals 
Pta ~X<b). (a,b) and (c,d) where 


bea = dec, 


Figure 5 Figure 6 


1,2 Mathematical Notation 
xX and Y are said to have a bivariate normal distribution if their 


joint density function, f, y %¥)> is given by 
ade 
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The quantity x is said to be an observed value of a numerical 
valued random phenomenon * 1£ for every real number x there exists a 
probability that X is ee than or equal to x, [In this problem the 
observed values of the random variables X and Y are the coordinates 
of the data points with respect to the target. These coordinates can 


also be referred to as miss distances in and across the line of sight. 








“he paraineters u, and a are the mean values in the x and y 
1] Fons <xes cave! the mean of a probability law i uivabent 
Pepeetions respectively. ihe mean or probability law is equivalen 


to the expected value of the random variable with respect to the proba- 


Melecy law, This is written as: 
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The mean value cannot be determined exactly in our problem even if all 
o£ the missiles have been fired but estimates of the mean values can 
be deternnined from the observations, 
le} 

The expressions (x - uy) and (y = u? are the deviations from the 
mean values in the x and y directions respectively. 
ee 

ik and Vy are the standard deviations in the x and y directions 
respectively. ‘he standard deviation is defined as the square root of 


wn 
the variance of the probability law. fhe variance V is defined as the 


second central moment of tne probability law and is defined by: 


my NP = K ~ EC] = aL - uly?) = Bex?) - a? 
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Pepchould be noted that the mean values determine the location (u_,u ) 
“> yY 
of the center of the norinal density function and the stancard deviae 


tions (Vx and Vy) determine the shape of the function about the mean 


in the x and y directions respectively. 


6 





Wee wD 
The correlation coecficient of two jointly distributed random 


variables X and Y is defined byP= COV. (X,%) = where 


ey 
al.) COV (X,Y) = ECX Y) @ ECX)ECY) 
2 no 
E(X Y) = { {sye, (x,y) dxdy 
os yt 


fhe correlation coefficient provides a measure of how good a predice- 
tion can be formed on one of the random variables on the basis of the 
observed value of the other random variable. In other words, if the 
value of one of the random variables is given, the expected value of 
the other random variable can be determined, This may be written as 


ECXPY) where the value of Y is given. That is, 
oo 

(1.6) ECXPY) = jx: py Sty ex where Ey Oty) 

wn OD 
is the conditional density function of the random variable XK given 
the value of the random variable Y. The conditional density function 
is derived from the conditional probability of a random event A, 
given a random variable % This notion forms the basis of the mathe- 
matical treatment of jointly distributed random variables that are 
not independent, ! 

In the particular case where two random variables X and Y are 
jointly normally distributed, the conditional expected value of the 
random variable KX siven that the random variable Y is some particular 
Value y, is a linear function. This linear function is related to 
the orientation of the shape of the density function as shown in 
Appendix A, 

I "tiodern Probability Theory and Its Applications" by Emanuel Parzen 


/l/ of Stanfore University. 
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In order to simplicy the notation, it will be convenient to 


Mepeesenc the bivariate densit; function in matrix notation, ‘The 


eens in cormila 1.1 are first arranged in the forn 
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uSing this notation, we are now ready to look at several models 


Pmyvestieating the CEP and confidence interval of 


Gietne (ele, 


meomesne Dasic Problem in Estimating the CEP. 


eme problem of estimating the CBP is essentially that of findine 

Meeeracius of a circle with center at (u_,u_) 
y 

a random point (“%,Y) will lie inside this circle. 


be expressed as 


(Tes) =P {Cat Ld PHU IAS oe Dy aa oak Gey) axcy umhere= =a, y) 
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1S siven by 


YX Xun) (¥-u,F an fom@eumem (1.1), 


such that the probability 
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Pimemces tomintrocuce tne problem, tnuesaesumptions will be wmade 
Saat tiemmean velues are zero (uy=u 0) gy that the errors in the x and y 
Pas yY 
Givections are independent ‘Ge = 0), and that the standard deviations 


are equal (Vx = Vy = = VV). The probability statement is thus simpli- 


peed. to 


(1.0) rly + ve nad ie ee - Si exp «= foe + v2 | cxdy = 22 


Im order to persorm the y Wa... art m a 2 4 v2 » -ane = 
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sig = Sine, 4 t3 COS. 


Then fF (R¢r) “ffs. [ecoss), (rsina}] J] dre were J =/dx/dr —- =X 
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diemerore, the CaP 4r= 1.1774V. 

Ine problem o=6Etimating the CBP is thus one of obtaiming a 
function of the n sample points (Xp oY) eeecveseee (KX s¥y) waiicheall 
@etimate the standard deviation VY. The estimators are functions of 
the observed values which are used to estimate the true values of the 


Belemeccrs, ~ For ckample, it m points from a sample are given, the 


Pverage or tneam value is estimated by 








ist 1) = a) of ey a et @ecooe 03600800 % fe 4 
x = Se & = 
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fhe distribution of X becomes closely concentrated about tue true 


velue u.. as m becomes larec, 





there are many ways to estimate tne paramcters under investigation, 
and it is therefore necessary to specify certain properties which are 
desire@ in estimators. For example, the distribution of the estimator 
q Pee | aug TAS 
should be concentrated near the true parancter value. If 7) ana S, are 


L ® 9 e ° ~~ A ™N 
different estimators of § with density functions £,(9,) and fo.) as 


: é a ? er ys : WN 
shown in Sisure (°), then S$, is a better estimator of § than 3 


$@) 


fhe. 





pee Jensity Sanctions of Two escimators 


Fiecure 7 
Other properties which are desired in estimators are defined as follows: 
moet .cletive Efriciency. The relative efficiency of two estimators 
is defined as a ratio of the mean square errors of the estima- 


t pails 
ors Phat is, 


A 
Cit?) CU 9)2 = ter. wWwhiecrerR.?., is tme rat hes function. 


BO - 9)2 


Th Ber. 4 1, then 


9 
cr is said to be a more efficient estimate 


of @ than Cor 


; Ie ; : 
1.3.2 Unbiased Estimator. An eStimator, @ is Said to be an unbiasec 


estimate of the parameter 6 if E@) = 0, 


e a e a 
ies “Coneliectent Lstimator, An estimator 8 is sald to be a consistent 


2 6 a A 
estimate of 6 if P@>e)rl, asnwaw , 
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meer, Geeivciént “stimator. Che estimators which have the smallest 


a) 


limitins variances are called efficient estimators of 8. 


~ 
oe 


The estimators which will be used in the first part of this tnesis are 


meow in Taole a, 


ee 
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Table a 
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Parameter Estimator hi Provertics | 
ee ee 

ae a= NS | Unbiased, efficiens , am 

: / n con Sist Oc, 


Unviased, esficient and 
consistent. 





| 
| a" y | 
T= 1 > 2 - 





A more detailed discussion of certain estimators under special 


assumptions is presented in Appendix B, 


iy .We Zroblem of Ocperndence 

In the eunnery -problen, the erromes introduced in the lintter sicie 
Seeedue to variations in the range and projectile initial velocity. fhe 
error across the line of sisht is due to bearings errors. Since bearing 
errors and range errors are independent of each other due to the fact 
that they are obtained from different sources, the mathematical assacp- 
tion is generally made that these errors are also statistically indepen- 
dent. Jowever, if we broaden the perspective to look at the major errors 
introduced in a missile trajectory, the major errovs in the line of 
Signt and across the line of signt are probably not independent o- 
Gach other. 

“nie ie@ primarily dué to factors which did not eSpecially influence 


the mmnery fire control problem such as errors in ship's navigational 


Wa 





Poeiewouesrcors Introcuced by missile accitude during tne time o: 
Doweret. @rieiit, especially at cutoff, and weather conditions over the 
Pace COC o 

In the gunnery provoicm there are two types of navicsational problems, 
MeemsizsSe is tre relative problem of ciring tro; 4 woving ooject to 
another moving tareet where the fire control problem is one of obtain-= 
fe cefative bearings, ranges, courses, and speeds. 3ut the firing 


Ship's true navigaticnal position relative to tie target is not an influe 


Tae second problem is one of shore bombardinent where the ship's 


mavigational position is determined by visual fix, This is closely 


relaced to missile launching except that the first shot in snore bon>barc= 


he suore observer can tel! 


Gin 


ment does not have to Lit the target because 
Bee Sip wnat Sspcts to apply to the generating fire control solation. 
Meerctorc, this again becomes a relative fire control problem where 
errors introduceca by the ship's and target's relative positions are 
corrected by Spotting, whis is not practical in lons rance missile 
launching because of the inability to obtain corrected visual navica- 
tional positions relative to the target due to lack of observers at tim 
Caxget area, ‘Jhat is done instead is that the probable errors swst bf 
Pprececermined and cnough missiles launched to sive a high probabilily, 
O@iecestruccion of the target commles. If we assume tiuat the launchine 


Siip is determined to be at the launch reference point then the errors 


a 


introduced are as shown in ficure (3), 


ie 
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Byrg = Firing bearing from 
Ship to poSition target 
will have at detonation. 


mrue Target Bearing Diasraia 


Tisure 8 


Byre is proportional to Hp ene Ser f 
LyYONE 


oe 
5 
ee) 


Pissproportional. to Lyong + Lxons { 
Lyong + Lyong 


wince Byre differs from Byrgt by the errors introduced in and 
across the line of sivht, the errors are reflected in the q(Jgm)s intere- 
DOlation computer aS errors in velocity to be gained, which have not been 
entered, Dut the errors introcuced are not independent because the ine 
DmuLS influence chances in velocity to be gained in both range and cross 


france cirections as sham below: 
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rlow Diagram oc the Crange ia Velocity to be Gained 
Fisure 9 

fa ome cunnery problem, the wéather conditions over the firing 
Saap's position are the same as the weather conditions over tne taraet, 
Ememeetore these valwes can be accurately estimated. Lue missile firing 
Ship depends upon intelligence and weather forecasts to precict the 
ames for tanget weather conditions. This information is therefore not 
Memeccizace as in the gunnery problem. Since the errors introduced by 
Weather predictions influence the missile trajectory over the target, 
tne re-entzxy vody is most likely to be moved in any direction and the 
probability that the errors in and across the line of sisht are inde- 
pendent of each other is low. 


she errors introcuced by missile attitude during cuto:f£ can best 


Pesillustrated by a vector ciagran. 








Sy Pa | 
vec ron witKoducced 
LAktHIG dh 

td 


Vector Diasram of Velocities at Cutofs 
Fiesure 10 

swe missile attitude at cutoff can be regarded as a random variable 
feemese it Can asSume any attitude due to tre Lact that the req:irements 
fo initiate cutoff are due to past and present missile velocity and not 
memoaeoredictedc velocity at some Qt after cutoff, Thus the errors intro- 
maucec urine the Ait of cutoff will influence tie errors in and across 
fee a Maacem manncr.  i.nevetorc, tne probability tia: 
Meme errors in and across the line of sight are indcpencent is asain 
site 

ome CONC .ISion is that due to the complexity of the fire control 
Pooblen, cife errors in ahd across the line of Sieht are probably not 
Beeepemmiene, if %6 approach tue prublem with cuis assieption and Bie 
Mimeetac increase in accuracy gained by this model is not sufficient to 
ioe ewe UO ince@aseein che mathematical cifsiculty, om insie@pehwent 
Memel can ve sel, 
1.5 0 Gusmare 


q 


SEClion cn2 Mas beck S&$a to set up the envirommfne oF tu@ problem 


i ail a | me ae % <s - if — ‘ = » ? _* — Cc 
mec 2S £© be Guwl: zew -(CebSe,< aseemoticnm madc is that tue all o- 





shot about the target is a random variable which obeys the bivariate 
normal probability laws, ‘The assumption has been made that the errors 
in and across the line of sipht are not independent and one of the 


objectives of this paper is to determine the effect of this assumption 


on the CEP, 
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seClIONG@ie 
Deere ING Ve CRF WITH DO DENSITY FUNCTION CAITERED 
AT MME TARGET: sODEL 1 
2.1 Introduction 

The most important essumption made in this model is that uy and Uy 
are zero. This means that the center of the bivariate density function 
is at the target. Aithough this is the desired condition, it may not bc 
true initially due to the complexities of the fire control problem, Oiie 
of the determinations that is made from the analysis of the firing data 
is whether a correction should be made to the fire control solution to 
feene the distribution of the fall of shot over the target. Therefore, 
by starting witn the asSumption that the center of the distribution is 
at the tareet and finding that this assumption is wrong, it becomes 
necessary to determine and apply the correction to the fire control solue- 
tion. \lso, it should be noted that although this assumption may not be 
Memeninitially, it still may be true after correcting the initial fire 
control solution. 

If the center of the distribution is close to the target, (0,0) in 
the coordinate system, or Suspected of being so by analysis of the test 
data, the estimators determined from this model may be better estimators 
Gen the estimators used in Model II in Section III. -\ comparison can be 
made between model I and model II, using the criterion of relative cffi- 
Ciguc. to determine which model is theoretically the best. Tuhis criter-~ 
245s @¥ulgined dn Section III, 

In this wnodel the errors in the x and y direction are assumed to 
be noneinaependens and cistributed tn accordance with the bivariate 


eoDabitlily idwe, Yhe probability chat a random point (“%,Y) will lie 


ef 





within a cirele of radius kV is equal to 


nox 
me.) rok wy) ap (x2 ey 2e te y= ]}{0., (x,y) dxdy= ] exn «tL Acdxay 
= oat eT 
yay? {kV max 2 wy? q kV max 


where Z and A are defined in (1.7). 

In order to integrate over this form, it is necessary to first make 
a transformation to an orthogonal density function. The reason for this 
is that due to the assumption of noneindependence (C# 0), this density 
function is oriented along non-orthogonal lines called the expected value 
of X given Y and the expected value of Y given X% or in simpler notation 
ECX/Y) and E(Y/X) as defined in Section 1.2.5. This orientation is 


illustrated in figures lla and llb. 






S > 
S t 
~ /ca\ 
sone ¢ od oN) 
is ‘ \ > 
. N 4 
& pod 
s 
b 
Ln] 
_— On 
X AXlS 
three Dimenstonal Diagram of Two Dimensional Diagram of tiie 
the Orientation of the Bivari- Bivariate Density Function 
ate Density Function where Formed by a Plane Tarallel to 
OZ <1, atb+oe 90°, cher x,y Phares Cttime the 
wensity Function. 
Figure lia Ficoure Lllb 
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Mis reerigatation 1S shown to be valid byproving that 
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Svewtranstormed density function thus becomes 


Si a Cu 9 v) So ] Cxp - ss js a = 
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rae reoriented axes are now aS shown in fisures 12a and 12, 
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Seaman is or tale Eydgnsftommacion are containec in Appenacix 1.5. 
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This transformed density function can be handled wore easily 
because the terms involving the correlation coefficient have been ree 
moved. The probability that a point (U,V) in the new coordinate syste 


Ser) lic within a circle with center at the origin and radius k Vqis 


(2.5) Vk, GW )= S{s, ,(u,v)dudv = P(k,c)=_ 1 [oe “eye Idudv 
uy TTT /, 


Pyte yr? < KV futey? £KTy 


where c= W,. This form is Simplified in Appendix ¢€,5 to 
Vu 

(2.6) P(k,c) = 2c ex = [(o? } 

reac m ec - L ‘te Cw+ 1) +(c* - 1) cos} 


= 


Fala og = aes o l)coss 


7) 


@resvalues o£ (k) for various values of P{lk,c) and (c) are tabulated in 
tables one and two, Table one is used by entering the table with 
c = W in order to find ke ‘fhis table can only be used for P(k,c)=.5 
Ve 
Table two is used by entering the table with c = Wand the probability 
Vu 


P(k,c) in order to find k. This table can be used for various values 


of wk, ¢c) ¢ 


See &stimating the CEP using todel I 
Mire first step is to Find estimators forVx,Vy and @ from the n 
observed points (Sy Vd eceeveeeee (KY). This is done by computing 


ee aN 
the sample variances Vx*, Ty’, the sample covariance Vxy, and the sauple 


aw” 
K) oe : 
Soeee lation woetficient Y “Which mise defimed as tollows: 
oo” — » a" a ae a \ a 
a ay ’ ee ee ae ve a / XK Va 
Vx - a 4 gers lle f Vee 4 fa 
i itt y NM al xy U4 A= | 
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f = Nay 
x Vy A 
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IN an 8 /\ 
Erechese formulas, VY x, ale and Vy are unbiased estimates of Vx,Vy 
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the transzormed estimates of the variances are computed next. 
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ne sl, ot Aah Te. 
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a an hau AGF en ta 
Vv 
Table two is entered ag P(k,c) = .5 andce= Y to find k. 


nex Vu 
The estimate of the CEP = CEP, = k Vig 


2.3 “estimating the CEP using the Assumption that the Errors in the 
mead yY Directions are independent, 


If it has been assumed that the errors in the x and y directions 
are independent, an eStimate of the CEP can be obtained by using the 


”/ 
oe - 2 s 2 
estimators in mocel TI except that the estimated variances Vs and V ¥ 


oN wn 
ae a e 4 2 2) 
are used instead of the estimated transformed variances Vd and VT ¥ 


aN 

cw es Train where Vmin = sin(Vx, Ty) 
~ 

<pmax Vr max - MG sake y) 


Table two is entered with P(k*,c%*) and c* in order to find k*, 


——s 
Then this estimate of the CEP = CEPy = lev Te. 


2e4 Information About the Problems, 
In the problems which follow, both estimates of the CEP will be 


obtained in order to compare the results in the Summary in Section VI. 


Qeo itxample Problems 

the problems, which will be used to compare methods of estimating 
the CEP, have been set up in three cuses, The first case will have ten 
Sarmle points (%5 974) coevoccvee (Kins) and is representative of the 


point in time where sama initial decision may be made as to whether the 
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it snould be noted that as the number of observations increase, the 


estinators are more likely to be closer to the true values, 


Meet asitions of tne 25 points are shown in diagrams 1,2, and 3. 


coordinates o£ the points are as follows: 
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The value of the CEP obtained usine the estimators from this 
Bection will! be compared to the estimates of the CEP from Sections III, 
Ivy, and estimators which are explained in Appendix B. Tunis comparison 
will extend to the problem of rejecting outliers and the comparison 
will be presented in Section VI. 

Althouch these problems are primarily oriented at tests involving 
the more expensive weapon systems, such as the IRBM, the environment 
can be extended to less expensive weapon systems woich will naturally 
nave more sample points, Although it was intended to make the problems 
as realistic as possible, no attempt was made to utilize data from 


méeural missile firings. 
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this model covers the problen of outliers and attempts to show 


some of tue reasons for eliminating the outliers from consideration in 


ter 


he cetermination of the estimates as well as several methods for 


eliminating them, 


(et Introduction to the ITroblem, 

Peewee 3eraleprobicm Ofmemovinge outliers is related to the fact 
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16 seers teasonable that the ovservation for the first shot should not 
be dmelured in the determination of eStimates for the CEE. 

Also, as improved subsystems are added to the missile, it is 
poSSible that the earlier wissiles will not have the same density 
function as the later missiles and thus have a cifferent C:P. In this 
case, it may become necessary to include only the later developed 
@assiies in the determination oz the Cr?. Due to the fact that the 
missile development will be a continuing process with each aissile 
Slisntly cifferent than the preceeding one, it may not be easy to ciS- 
tintuish between these distributions, Tiis is because both distribu. 
means witli ave some observations close to the tarset and others auay 


mrom the target. Tne figures below may help to illustrate this point, 


Y¥ aAxys 
x 
| 
v3 





© y yA 
% ©) ro) x RK TS a ee 
X 

Observations from rwo Yensity Functions o7 Sige 
bistributions about the Target Distributions about the Target 

ieee’ population 

| second population 

Fieure léa Fioure 14b 


41 


Mm sioule be noted in fieuwresd4b that distribution I has some 


4 = 


Due amelity of occuring in distribution Ti. If this probability is 


ao 


larze, it may be extremely difficult to separate the two distributions. 
IneGact, i: it is desired to separate the two distributions, there is 
some probability that observations belonging to the underlying distri- 
Bution under consideration will be removed along with the observations 
£Yom tue distribution tnuat is not beinz considered. Thus one of the 
problems in removing outliers is to keep the probability, that the 
Seeservation&s rémoved as outliers which do in fact belong to the under- 
Maing distribution, @s low as possible. If this probability is small, 
if is possible that tne observations beloncsing to the underlying cis- 
Eribution which are still removed will have such a low probability of 
Becumence tnat thcir semeval will still lead to a better estimate of the 
parameters, This may be especially true for snall saxple sizes where 
one sufficiently large or small observation can totally ruin an analysis 
of the data, Therefore, in order to eliminate an arbitrary vesult, it 
is necessary to establisn Some criteria for eliminating these outlying 


So8ervations. 


‘1,2 aeeriperiawier Rejectionse< Outliexs 

HWateurally skots which land at long distances from tno tarset can 
be easily identified as wild shots or outliers with possible unkno™m 
Prmors, but as tae observations move closer to the target, it »beegines 
neeessam; to utilize some typeof probabalisticeconsieeration for cue 
mpeaeetionvof outlyine observations. One way to approach a solution to 
Chis problem is to set it up asa hypotauesis tcestingeproblem. Gn tie 


aad = soe . nat is ae : py 7 ae Namie ie 8 : pa = 
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the hypothesis that the observed point CX. 54) belongs to the underlying 
distribumion. fhe test is then conducted for each (Ki Vy) LOY eee. 2s 
Siesoweett, ONE At a time. The alternate hypothesis is then that the 
observed point Xs V ql coes not belong to the underlyine distribution 


mmereor some dicrerent distribution,  This»*can be written as: 


+ | 6 té eee i Na 4 = 
uy: iF 7 OG 99a) A? (x, 994) were: ff. ., (S50Y,) is Ene 
‘9 - oan. 0.5" 0 1 


menue underlying distribution. 

The probability of a Type I error will be called v where v is the 
probability of rejectins the hypothesis that the point (x, ¥;) does 
belong to the underlying distribution when in fact it does belong to the 
Miderlying distribution. This can be expressed as 

Prob [Type I error] = v 
The probability of accepting the hypothesis that some point (x. 5y,) does 
belong to the underlying distribution when the point does not belong to 
the underlying distribution and is called the Probability of a Type II 
Error. 

Thus the probability of the Type I error may be called the risk 
tnat the experimenter is willine to take in making a mistake by rejecting 
a point (xa sy,) as an outlier which does in fact belong to the under= 
lying distribution even though the observed value does exceed some value 
Seecitived by the criteriay Naturally, it is desivable to try to leeep v 


small but if v is too small then the Tvpe II error will increase and all 


ity 


outliers will be included in the determination of tue parameters. 
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ees cunos i For ene Rejection of Outlicrs:s 

Clie method for the rejection of outliers is based on the probability 
that a random point (X,%) will lie within the cllipse Z'AZ =k. Z'AZ is 
the matrix notation for the quacratic form of the dependent bivariate 


Mormally distributed random variables X and Y. That is 
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k is defined by 


(4.2) ie (zt ARS k2)= Oe Wa Lee ie <2 NY 
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eck 


@e--e, Collyeilevis theiprobability tat the point <(“%,Y) will lie 


ansice the ellipse made by a plane parallel to the x,y axes cuttine the 
Bemsicy cunction a@& show in figure 15, a 
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Offset Sllipse cade by a Plane Parallel to x,v Axes 
Cyceine, Guie Leusity Funceion 


Figure 15 


mMacomt ce orientation of this density function, iC 31s neceseary 


make the transformation to the ortnogonal u,v coordinate system in order 
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Meera onvyeare tis “oye, This tronsforwetion is wade in the same 
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manner as in Sections II and III. The probability can now be expressed 
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fe : 4 olor 3 2 JJ 5) } a 1 oat 
nae reer] < le jim = ——— = wo i]t Ave} 
oo) 6=—s Pa < ky) [fay yCusv) dudv aT, )} exp (-41# A*"1) dudv 


WAY W< Me WA" W < ke 
wnere tail = ut - ve 
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letting t, = Bs F ee. (4.3) reduces to 
Vee Ne 

4) MS ke ) = [sox okt) dt 3 
2 <K, 


ry 


Tre random variable T has the Chi Squared distribution with two degrees 


i 
Ju 


Seeireedom, -:ne above formula is a special case of the following result. 





If 5, are indepencent and normally distributed random variables with 
BE 
means u, and variances ; then 
yr) 2 
nn 
= > ecm Le 2 
(4.5) i @ 4 
aa We 


The desrees of freecom m is the number of independent terms in the sum. 


Gre censity function of T. is 


a 
(4.5) nee (t) = Se ee 6) 
+m mM _ Co 
Ge tpi Ma? 
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The areas under this density function are partially tabulated in Table 4, 
The desired percentage of the area under this curve is found by entering 
Table 4 with 1 ~- v ane tne degrees of freedom m. 

“ne decision rule tnat is used for the elimination of outliers is 
S €S Qa Oe. Ge. ier 


tO state that an obsexvati 


3 (at pS ‘ m= SD : ras t a . ‘ “ . 4 ose ’ : ’ “a 
Prittemewaion co <A2 Tieom! of Statistics" by Ag &. Mod /2/ of 
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4.4 Method II For the Rejection Of Outliers. 
This method for the rejection of outliers is based on the probability 
Bmat a random point (X,Y) will lie within a circle of radius kV. 


max ° 


*nen, letting 


r= [ (x ~ Lye + {y eo eel: k is defined by 


Z 
va 2 aa \ 29 1p ae , = oo 
(4.8) P Te ude + C= TM St J Fy yOty)dxdy = lev 


vc kK, Vino.x 


Geometrically, it is the probability that the random point (X,Y) will 
lie insice the circle imposed on the quadratic form made by a plane 
parallel to the x,y axes which cuts the density function as shown in 


Figure 15 ® 
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Lllustration Beovwime be Circle of Interest 
which is Imposed on tne Ellipse tiade by a ilane 
Parallel) teow theexyy Plame Cutting thesensity funetion 
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‘ere 16 


bue to the or®entation of chis demsity furfction, it is ai'so neces~ 


Sary to mace tne transformation to the orthogonal u,v coordinate system. 


The geometrical areas under consideration are shown below in fisure 17 


Sm tiis transfowmed density function. 
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Illustration Snowing the Circle of Interest 


Waici is Imposec on the Ellipse gace by 2 Plane 
Pacnallel to the u,v Plane Cutting tne Mensity Panctiog 
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Pierro beenoces tiec this “memes will reject points outside thd 
earclc Wa wieice tne cllipse which ts estimated fron the cata points. 
fherefore, unless tne variances are equal, this «zethod will generally 


mer irom the tavpemee an weliod I, since sone points 
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Scere 1o 
On or near the maior axis will be outside the circle as shown in 

[uemec 1/, "ne citcle is necessarily of: Smaller diameter than the 

Mayon axis of tne ellipse unless the variances are equal an& then the 
Circle and ellipse will be synonomous, This can be seen from the followe 


mie inequalit:: 


ay ed ey wv, ee a Wage” : 
(4.9) ee yt 72 where Jo... = nax (Vz, V4) 


The probability that the point (U,V) in the transformed coordinate 


p10) pf foe? sa] a Sfeny (u,v) dudv = P{k,¢} where c= Vy 
- 
Y a kVu ‘ 


This formula is the same general formula that was used for the determin- 
@tion of the CEP except that .5 nas now been replaced by (1-V) in the 
renee fiom .59 1. The decision rule that is used for the elimination 


mamoveltors 1S to state Eka@e an observation is an outlier when 
@11) Wy Ae = Bey 7s where Ky is obtained trom cable 2 by enterin 
i 


A 
with 1-V and the value c= Vv. It should be noted that this value 
r—— 
Fa 
of k defines the radius of the circle centered at (Ut) which includes 


(l= V)100% of the bivariate probability mass. The value of k obtained 


from method I defines the ellinse which includes (1-Y)100% of the 
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tne parameters re 
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If it is assurec 


be use@g Also, the critcurem 


GO: relative e. ficiency can be used to determine whether medel I or 
model JI should be used rhe estimates of the parameters 
A oN “A 
ety, vy, . BR 
nae x3 Vys XY v, are then computed by using the selected model. 
aL Fars C 2lue ae a ; mn FC Sw tod 3° GQine ) b 
Diewcatimackee Valte Of El@ MaErix &£ 1S Conputed next uSing& Cive above 
estimates. 
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Ne Se. 
uz Vy Vy 
Wommally theswalue Vis predetemminec by the experinenter and tae 
outlier rejecte:’ on the basis of this valve, t is advisable to velete 
tte ovtliers one at a time until all of the date points axe inside tue 
region prescribed by the probability l-Vand the hetiod used, ‘his is 
mie to the fact that the estiwiated snape of thd clryve is dependent ‘upon 
Die Tela gmeintc an® cach Ofictéa® point will preduce séhe chanee in the 
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Mee <0 OF MOrc points wich satisfy kis requirement, tue point is 
deleted .irst whica nas the greatest valved quacratic form, 
it is then necessary to recompute the estimators and use the abov~ 


procedures again, thus removing outliers one at a time, until there are 


CG 


mo points left with estimated quadratic forms greater than k¢ (i=lor2). 
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The final estimate of the CEP is then deternminec from the cstimators 
dérivec using tne cata from the remaining obServations. <Zhis cstimatc 


| a 
Oz the CEP will se referred to as CEP, . where tue subscript i refers 
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to tne number of data points removed, 


metormation Aout tite Problems. 
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meorder to illustrate the abovesmethods, true sample problems ziven 
Pmeesection 2,dsycre used. ‘odel Ii mwas chosen arbitrarily for estima- 
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the parameters for illustrative purposes. Soth methocs of re‘tect~ 
ine outliers were set up for cach problem case but insteac of rejectins 
GmerlersS ic) aiw Specific probability, the tables were Sct up to show 
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meta Points in Problem I, M=10 
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‘O% probability by Method 1 and 95% probability by Method.2, 
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removed as an outlier by method 1. For purposes of illustration, this 
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Pea introduction 
“a@ previously introduced estimates of the Cef are all called 

point eStimates where the estimate of the CEP was defined by the locus 

€ a point moving at a constant distance (the racius) from a fixed 
point (called the mean ox (ut). Time constant distance or radius 

is called the CaP, The confidence interval of the CEP attempts to 
Give ome seegure of tac possible error in the estimate;of the CE. 
aueeconfidence is dezined @s tne probability that the true value of 

tage Cel lies in am interval »bctween Ly and! L, where My anc Lin are 
functions of the random observations yo gds Pe 12 eee sells ogni? orci 


pression in probability notation is 
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This interval estimate is a function of the conficenece required, t 


number of observations, and the estimate of the standard deviation use, 


D% 2 Obtaining the Interval Estimate 

In order to avoid lengthy computation in obtaining the interval 
Cgtamate, it is assumcd that the variances are equal, That is 
Tv YF4 a re 

The CEP was defined in Zection 1.2 as being equal to k Vo vhere tic 
velve k dg @ Tunchionwg® the ratio of the vafianees and the provabil1 tp 


ehat the mean centered ciréle contains 507% of the bivariate censity mass, 
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See Section 2,! 
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nee the variances are assumed to be cqual, the ratio of the variances 


il 


1s 1, Wea Ck,1) = .5, so that ki 1.1774 (from Table 1 with c = 1), 
Although the variances are assumed to be equal, the astimates of the 
variances are not necessarily equal, 
he estimate of the standard deviation will be determined by the followe 
ing, two methods, 


Pec.) “termining the Confidence Intérvgl, Wethod 1 


Aa A A ees 
In this method Nes max] V2, V7) will be selected to represent Vv, 





That is 
Pay Mv = am _ 
(6.2) G2= $(2,:2Z) » vox [ Sik)’, 501-9 }? | 
js Ne nel nel 


a 


“\ 
i we is divided by the true value of the parameter and multiplied by 


nel, this formula becomes : 


.3) nal Ts * PG) a 
aa A *j 


Although the sum in (5.3) will not be an exact chi squared random varie 
Able because it ts the maximum of two chi squared random variables, an 
approximate confidence interval can be obtained by treating (5.3) 
though $t were @ chi equared random variable,” 


the confidence interval defined by (5,1) thus becomes 
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eee 4 lee) atta cither l=e 280280 2 respectively. 
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Tne estinate of the variance in this metho is the average of the 
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It should be noted that the confidence interval bccomes sinaller as the 
numba of observations increase, Tunis ies thak the true value of 
tne CsP is sore likely to be within a smaller interval as tne number of 
epservations increase. 
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the different methods, some comprehensive distribution theory problems 
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SUMMARY 


Gel Introduction 

The previous sections have been concerned with the development of 
different types of models and methods for estimating the radius of the 
mean centered circle wnich includes 50% of a bivariate probability 
mass. This section summarizes the cifferent models and methods used 
in the previous sections, anc includes an analysis of the results 
obtained from problems, Although the sample problems do not represent 


th 
. 


actual missile test results, an attempt has been made to make the data 
as realistic as possible, Therefore an analysis of the problems 
should show certain relationships between the models used to estimate 


the CEP that would aiso avpiy to actual missile test data, 


ve! moartoomwo.) oCeCl Lavith Model II. 


CEN 


The basic underlying assumption made in Lode] I was that the true 
value of the mean was located at the target, (0,0). Therefore, the 


Mere i: thie model is defined as the radius of a circle around the 


hat Ehe tre 


fs) 
qt 


The basic uncerlying assumption made in viocel II wa 


value of the mean was located at some point ie) away from the 


* 


fere@et. Tierefore, the estimated Cer for this #wo@el is the radius of 


mad 


€ Gircle with center at some point, (.y). 
A comparison cf the estimate of the correlation coefficient shows 
that they change in much the same ranner in both moceis., As suspected, 


5 2 = * e 1 . 7 ay “ ae 
a maior difference between these nodels is in the location of & andy. 


. + }- wm 3 = 3 ~ a ‘ * 218; oy ae a >) ‘7 : + bs, “Al Tow 3 4 Ne ow An : 2 
Mais is Sagem if Bieerau® 72. 32, ara 73 wWeteeh 1a iieerate the estigires 





pe for @lcobl@ims 2 and 3 is practically 
the same in all tnxree cases, Therefore, vuen the center of the distri- 
bution is near the target, there is Little vractical difference between 
the two models. iiowever, in problen 1, the distribution of data points 


s around sowe point (x,y) away from the center. If the procedure 


eta 


given in Appendix 2 is used to estimate the ratio function, then the 


a, 
Y om 


marries obtained indicate that EP 5 ves the best estimate of the CEP 


69 


tor a sample size of 10 in provilem 1, Aliso, as the sample size increases 
9 de e t atl ® ° 4-1} eink ° “a % 4 > 
the ratio function increases, thus Clb, is aiso the best estimate jos 


n> 19. The values of R.F,. odtained for problems 2 and 3 show a pre-e 
ference for Model I for small sample sizes ana are very close to 1 for 
larse sample sizes and therefore either estinate may be used, 

These problems tend to substantiate the fact that the procedure of | 
Model II is superior to the procedure of todel I in large sample sizes. 
Thev also suggest that if Model I is used in analyzing a small number 


of observations, it micht be advantageous to check the assumption of 


mean (0,0) by computing the sample means. 
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In the introcucticn to the problen of estimating the C&P, the 


assumption was made that Che errors in the x and y directions were not 


independent. Tunis assumption is natural unless an apriori knowledge 
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SULTS 
jowever, the assumption of independence tn the fire control procvlem is 
@eote ciificuit to justity due to its compiexity. Therefore, it would 
seem wise to estimate the magnitude of the error involved in assuming 
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independence in order to find out how much difference this assumption 
wili mean in the deteraination of the Cir, 

It was shown in Appendix A that the true orientation of the density 
function was related to the correlation cocfficient, If the true shane 


of the density function is oriented at sowe angle with respect to the 


umed, the computed standard ceviae- 
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xX and y axes and independence is 
tion is not the best estimate of the standard ceviation. Consequently 
the independence assumption introduces an additional error in the 
estimate of the CaP. 

Table £ is used to illustrate some of the important differences 


in the results obtained from the problems using tne two models. 
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Model 1: {I (edo I = fodel D1) 
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Tie table snows some difference in the magnitude of the racius of 
the CEP as estimated by the two models with the maximum difference being 
pe o/3e-/ or 3.37%. Also, the trend in the size remains constant between 


Che two models. That is, as the size of the estiuated Cer cnanges in 


one model, it changes in the other model in the same direction. 


th 


Grap. i shows a plot of the percent diffezvence in the independent 
and dependent estimates versus the correlation coefficient, It should be 
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emphasized that the points on the sraph were obtained from data conputed 
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ifferences in the estimates of the CEP from the problems are 
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Mien ih Ciasrams 54, 33. anc 36. Some of the differences were so 


Grell chat these estimates were left off. I: is interesting to note 
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Ehat the distriduticn of data iit, rroblem 3 shows almost perfect correla- 
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6.4 Uff£eets Caused By The femoval Of Outliers. 

the most obvious effect on the CZP when outliers are removed is 
that thewGer becomes smalle lowever, there are geveral other effect 
wiich are not obvious but may be important in determining which estima. 
tors can be used. Tables, using the sample problems, gives a cowparison 
between Method I and Vietnod II and the estimates of certain parameters 


before and after removal of outliers, 
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The estimate of the CEP was reduced by from 14% to 36% in the 
problens by the removal of outlicrs. If a probability of the ‘ype I 
error had been specified as .95, the point rejected as an outlier in 
problem 1 would not have been rejected by the elliptical method but 
would haye been rejected by !‘ethod II, This is because Method I and 
“ethnod IL are not the same and will not necessarily reject the same 
points fer the same confidence level, The effects of removing outliers 


are shosm in Diagrams 37, 30, and 39, 
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It should be noted that the removal of outliers may change both 
the correlation coefficients and the difference between the standard 
deviations in the x and y directions, This is due to the large effect 
that an outlicr has upon the distribution parameters. Thus a large 
correlation coefficient may be due to the presence of an outlier and 
not due to correlation between the errors in the x and y directions, 
Therefore, before the independent method of estimation is rejected, an 


investigation should be made for outliers. 
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APPENDIX A 


LEN TAPION AND [TUANSFORMARTON OF ie BIVARIATE DENSITY FUNCTION 


A.1 Introduction 

This appencix is concerned with the orientation of the bivariate 
normal density function over the x,y plane. Primarily this requires an 
investigation of the correlation between the random variables X and Y 
and once the correlation is determined, a transformation of axes so 
that the function can be integrated more easily. 
Ae2 Orientation of the Axes 

If the correlation coefficient is zero, that - the random variables 
~ and Y are independent, the orientation will be symmetrical with 
respect to the x and y axes, This means that a plane parallel to the 
x and y plane will cut the density function in the form of an ellipse 
wiose minor and major axes are parallel to the x and y axes, This is 
shew below in figure Awl, Note that if x= Vy > the ellipse becomes a 


efrole, 





Oxdentation of the Ellipse when P 0. 


Figure Awl 
Tf tne correlation coafficient 18 not zero and less than plus or 


ndnug 1, the orfentation is offset from the = and y axes in the direction 
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oer Introduction 


The theory of estimation is concerned with the problem of finding 
functions of the observations such that the distribution of these 
functions will be concentrated as closely as possible near the true 
values of the parameters estimatec. The density function of the obser~ 
vations under consideration was described in Section I and the para- 


meters which are to be estimated are u,,u, Ty Vand Cs 
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Some of the properties which are desired of the estimators were 
described in Section 1.3. 
Be2 Maximum Likelihood Estimation 
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is easier to maximize the logaritha of the likelihood function rather 
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logarithm has its maximum at the same point as does the likelihcod 
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The maximum Likelihood estimate of each of the unknown parameters 
is obtained by setting the derivative of the function with respect to 
each of the unknown parameters equal to zero and then solving the 
resulting equations simultaneously. To iliustrate this procedure, the 
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The expected value of the estimator in formula (B.6) is obtained 
by recognizing the fact that there are 2(nel) independent squares in 
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the sum and therefore 2n Thyra is a chi squared random variable with 
2(nel) degrees of freedom as defined in formula (4.5). Since the ex- 
nectec value of a chi squared random variable is equal to its degrees 
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It should be noted that the estimators (2.10) are used in Model II. 


Also, if the assumption is nade thar tne true values of the means are 
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zero, then the estimators for the variances in lodel I are also unbiased, 
The estimators of the means and variance used in lodel I and Model IT 
are, apart froin the biasing factors, maximum likelihood estimators, 

However, it should be noted that the CEP is a function of the 
standard deviation and not the variance. The following section will 
determine unbiased estimators for the CEP using the procedure in this 
section. 
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A possible procedure for using Table 1 is as follows: 
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The values of ® and Y are first computed. ‘Then V; and Viare come 
puted using formulas (B.11) and (3.15) respectively. The estimated 
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attempt has been made to theoretically justify this procedure, 
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Analysis of Table m Using the Above Values 


1 for k = 484, Ref. >1 for all n>5, therefore CEB is best 
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